VANISHING SUMS OF mTH ROOTS OF UNITY IN FINITE 

FIELDS 



T. Y. LAM AND K. H. LEUNG 

Abstract. In an earlier work, the authors have determined all possible 
weights n for which there exists a vanishing sum & + ■"" + Cn = of 
mth roots of unity Q in characteristic 0. In this paper, the same problem 
is studied in finite fields of characteristic p. For given m and p, results are 
obtained on integers uq such that all integers n > uq are in the "weight 
set" W p (m). The main result (1.3) in this paper guarantees, under suitable 

conditions, the existence of solutions of xf H \-x% = with all coordinates 

not equal to zero over a finite field. 



1. Introduction 

By a vanishing sum of m th roots of unity, we mean an equation a\ + • • • + a n = 
where a™ = 1 for each i. The integer n is said to be the weight of this vanishing 
sum. In [LL], considering mth roots of unity in C, we denned W(m) to be the set 
of integers n > for which there exists a vanishing sum a±-\ — • + a n = as above. 
The principal result in [LL] gives a complete determination of the weight set W(m) 
(in characteristic 0), as follows. 

Theorem 1.1. For any natural number m with prime factorization p^ 1 ■ ■ ■p° r r , the 
weight set W(m) is exactly given by Npi + ■ • • + Np r . (Here and in the following, 
N:={0,1,2,...}). 

In this paper, we study vanishing sums of mth roots of unity in characteristic p. 
In analogy to the characteristic case, we define W p (m) to be the set of weights 
n for which there exists a vanishing sum cti + • • • + a n — where each ctj is an 
mth root of unity in F p , the algebraic closure of the prime field F p . Note that, if 
m = p t m' where gcd(p, m') = 1, we have x m = 1 in ¥ p iff x m ' — 1; in particular, 
W p (m) = W p (m'). Therefore, we may assume throughout that gcd(p, m) = 1, i.e. p 
is not among the prime divisors pi of m. As in the case of charactersitic 0, we have 
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Pi G W p (m) for all i. But in characteristic p, we also have p G W p (m) (due to the 
vanishing sum p ■ 1 = 0), so now 

(1.2) lf p (m)3Nj) + Npi + --- + Np r . 

Easy examples (see (2.1)) show that this need not be an equality in general, so 
we are left with no viable conjecture on the structure of the weight set W p (m) in 
characteristic p. However, (1.2) does show that, if m > 1, all sufficiently large 
integers n (in fact all n > (p — l)(pi — 1)) belong to W p (m). A more tractable 
problem will then be the determination of more accurate bounds n such that all 
integers n > n belong to W p (m). 

In this paper, we will show how such an integer no can be determined. Our work 
is divided into three cases, depending on whether gcd(p — 1, m) is 1, 2, or bigger. 
The estimates on n differ from case to case, and are given respectively in (5.6), 
(4.1), and (3.1)-(3.3). Although we have three different estimates on n , there does 
exist a (necessarily weaker) uniform estimate for all cases. In the following, we shall 
try to explain what this uniform estimate is, and why is it a reasonable one. 

A guiding principle for our work throughout is the fact that a finite field is a 
Ci-field (see [Gr]). UK — ¥ p k is a finite field containing all mth roots of unity, 
then, for d := {p k — l)/m, the mth roots of unity in W p comprise the group K d . 
Therefore, a vanishing sum of m th roots of unity of weight n corresponds precisely 
to a "good" solution of xf + ■ ■ ■ + x d = in K, where by a "good" solution we 
mean one with each Xi ^ 0. If n > d, the fact that K is C\ implies that we have 
a solution (x±, ■ ■ ■ , x n ) ^ (0, • • • , 0). It certainly seems tempting to speculate that 
there exists in fact a "good" solution (in K). If this is indeed the case, then by what 
we said earlier in this paragraph, any integer n > d will be in the weight set W p {m). 

The desired conclusion that, for n > d, xf + • • • + x d n = has a "good" solution 
in K is, however, not true in general! For instance, if d = p k — 1, then x d = 1 for 
each x G K, so we have a "good" solution for xf + • • • + x d n = in K only when 
n is a multiple of p. In a similar vein, if k — 1, p is odd, and d = (p — l)/2, then 
any nonzero dth power in K is ±1. For any odd integer n G (d, p), the equation 

xf-\ \-x d = again has no "good" solution in ¥ p . The trouble with these cases is 

that m < 2, for which we don't have "enough" mth roots to play with. As it turns 
out, as soon as we ignore the above cases, we'll have the following uniform result for 
getting "good" solutions. 

Theorem 1.3. Let K = ¥ p k and d = (p k — l)/m as above, and assume that m/1, 
(m, k) 7^ (2, 1). Then, whenever n > d, the equation xf + - — \-x d = has a "good" 
solution in K. In other words, the weight set W p {m) contains all integers > d+ 1. 

The results in §§3-5 below will cover this theorem in the case m > 3. In the case 
m = 2, (1.3) is quickly checked as follows. Since we assume in this case that k > 2, 
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we have d > (p 2 — l)/2 > p — 1. Given n > d + 1 > p, it is easy to solve the equation 
oil + • • • + OL n — with ctj = ±1, by considering the parity of n. Having disposed of 
the trivial cases m — 1, 2, we mm/ assume in §§3-6 o/ i/iis paper that m > 3. 

In the case when p is odd and d = 2, (1.3) says precisely that, for any n > 2, the 
quadratic form + • • • + x\ has a "good" zero over any finite field of more than 
five elements. This is a special case of a well-known observation of Witt for isotropic 
diagonal quadratic forms (see [Wi: p. 39], or [BS: p. 394], [La: p. 25, Ex.7]). Thus, (1.3) 
may be thought of as a generalization of Witt's result to the higher degree diagonal 
forms xf + --- + x d over finite fields. Note that d \ {p k — 1) is not a really essential 
assumption in (1.3). In dealing with the equation x\ + • • • + x e n = 0, we can replace 
the degree e by d := gcd(p fc — 1, e), and define m to be (p k — l)/d. Then K e = K d , 
so under the assumptions of (1.3), x\ + • • • + x e n = will have a "good" solution as 
long as n > d. 

In the literature, there are many results dealing with diagonal equations over finite 
fields; see, for instance, [LN], [Sch], [Sm], and more recently [QY]. Conventionally, 
one could apply algebro-geometric methods, or alternatively the method of Gauss and 
Jacobi sums. As the referee of this paper pointed out, these methods can be utilized 
to show the existence of "good" solutions to a diagonal equation xf + • • • + x d n = 
(n > 2) in W q if q is suitably large compared to d (without the condition n> d). 
However, these conventional methods do not seem to give enough information if q 
is "small" in comparison to d. In our setting, working mostly with n > d and 

taking full advantage of the additive nature of the special equation xf-\ h x d n = 0, 

we apply instead the methods of additive number theory. These methods do give 
fairly precise results, without reference to the size of ¥ q . In fact, the analysis in 
§§3-5 will not only prove (1.3), but also show that, in various cases, the equation 
xf + • • • + x d n = has a "good" solution in K often for much smaller values of n 
(than n > d + 1). Thus, the more precise results in this paper are to be found in 
(3.1)-(3.3), (4.1) and (5.6). Theorem 1.3 is only a common denominator of these 
results giving a convenient and uniform summary of the main work in this paper. 

Acknowledgment. We thank the referee of this paper, whose comments enabled us 
to rewrite more accurately the last paragraph above comparing the use of different 
methods in treating diagonal equations over finite fields. 

2. Some Basic Examples 

We shall begin with some examples and computations of the weight sets W p (m). 
The first couple of examples show that various properties of weight sets in character- 
ictic are no longer valid in characteristic p. For convenience of expressing weight 
sets, let us use the notation [n, oo)i for the set of integers > n. 
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Example 2.1. Referring to (1.2), the smallest positive element in the set W p (m) 
may not be min {p, pi, • • • ,p r }- For instance, when p — 11 and m = 5, the 5th 
roots of unity in F n are {1, 3, 9, 5, 4}. Observing that 1 + 1 + 9 = in F n , we see 
that W n (5) contains 3, which is smaller than 5 and 11. By (2.3) below, we have 
1^1(5) = {0}U [3, oo) z . Thus, not only (1.2) fails to be an equality, but also Wu(5) 
is not even of the form ^ N qi for a set of primes % 's. 

Example 2.2. Contrary to the characteristic case, the set W p (m) may be larger 
than W p (mo) where mo is the square-free part of m. For instance, let p — 5 
and m = 4, so m = 2. It is easy to see that W$(2) = {0, 2}U[4, oo)z, but 
W 5 (4) = {0}U[2, oo) z . 

Example 2.3. Let q = p a > 5 where p is an odd prime, and let m — (q — l)/2. 
Then d := (q — l)/m = 2. For any n > 3, the quadratic form X\ + • • • + X 2 is 
isotropic over ¥ q , so by the theorem of Witt referenced before, it has a "good" zero in 
F q . Therefore, n G W p (m). It follows that W p {m) = {0}U [2, oo) z if g = 1 (mod 4), 
and W p (m) = {0}U[3, oo) z if g = 3(mod4). 

Example 2.4. (F p contains all mth roots of unity.) Let p = 31, and m — 3. The 
third roots of unity are {1, 5, 25}, so the equation 25 + 6 ■ 1 = G F 3i shows that 
7 G W 3 i(3). A routine computation shows that W 31 (3) = {0, 3, 6, 7} U [9, oo) z . 

Example 2.5. (F p contains no mth roots of unity other than 1.) Let p — 2, and 
m = 73. We work in K = F 2 9 which contains all 73 rd roots of unity. By standard 
tables of irreducible polynomials over finite fields, the trinomial f(X) = X 9 + X + l 
is irreducible over F 2 , so we can take K to be ¥ 2 [X]/(f(X)). Let a := X G K. 
We have = (a 9 + a + l) 8 = a 72 + a 8 + 1, so a 73 = a(a 8 + 1) = a 9 + a = 1. 
Thus, the relation a 9 + a + 1 = shows that 3 G Vy 2 (73), and it follows easily that 
W 2 {73) = {0}U [2, oo) z . 

In the balance of this section, let us consider W p (m) in the case when m is a prime 
power (not divisible by p). Under a special hypothesis on the cyclotomic polynomial 
$ m (X), the weight set W p (m) can be determined explicitly. 

Theorem 2.6. Let m = £ a where £ is a prime different from p, and assume 
that the cyclotomic polynomial $ m (X) G 7L\X\ remains irreducible modulo p. Then 
W p {m) = Np + N£. 

Proof. Of course, it suffices to prove the inclusion "C". Let ( be a primitive 
mth root of unity in F p . Let m' := m/£, and a := ( m (a primitive £th root 
of unity). Let K = F p (^), and L = F p (a). Since $ m (X) is irreducible mod p, 
[K : F p ] = ip(m) = m'{£ — 1). From this, it is easy to see that [K : L] — w! and 
[L:¥ p ]=£-1. 

Any vanishing sum of mth roots of unity can be written in the form Y17=® 1 9iC = 
0, where each g { is a sum of £th roots of unity. Since the degree of ( over L 
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is m', the elements 1, £, • • • , £ m ' _1 are linearly independent over L. Therefore, 
each gi E L is itself a vanishing sum, and it suffices to show that its weight is in 
Np + N£. Starting over again, we are now down to considering a vanishing sum 
Si=o ai0L% = 0) where each G N. Let a, be the smallest among the 's. Since 
the minimal equation of a over F p is 1 + a + ■ ■ ■ + a e ~ 1 = 0, it follows easily 
that do = a-i — ■ • • — ap_i G F p . The weight of the vanishing sum in question is 
^\ Oj = £dj (mod p). Since £\ Oj > £aj, it follows that £V a, = £aj + bp for some 
b G N, as desired. □ 

Remark 2.7. A vanishing sum of mth roots of unity is said to be minimal if no 
proper subsum of it is also vanishing. In general, the problem of determining the 
minimal vanishing sums is difficult (both in characteristic and in characteristic p). 
Under the hypothesis of (2.6), however, this problem can be solved. In fact, the 
argument presented in the proof above can be used to show that, in the setting of 
(2.6), the minimal vanishing sums of mth roots of unity are, up to multiplication 
by a power of (: (1) p • 1 = 0, and (2) 1 + a + • • • + c/ -1 = 0. (Of course, this 
implies that W p (m) = Np + N£.) For this conclusion, however, the assumption on 
the irreducibility of <& m (X) modulo p is essential, as the examples (2.1), (2.4) and 
(2.5) show. (In (2.1) and (2.4), $ m (X) splits completely modulo p, and in (2.5), 
$ m (X) splits into the product of eight irreducible factors of degree 9 in F P [X].) 



3. The Case gcdQo- 1, m) > 3 

In dealing with W p (m), our main goal is to find good estimates for integers n such 
that [no, oo)z Q W p (m). We begin our analysis with the case when gcd(p— 1, m) > 3. 
This case turns out to be fairly easy if we use the right tools from additive number 
theory modulo p. It will be convenient to use the following notations. For a subset 
A in a field, we shall write \A\ for the cardinality of A, and for any integer n > 1, 
we write n * A for the set A + ■ ■ ■ + A with n summands of A. 

Theorem 3.1. Assume that m := gcd(p — 1, to) > 3 and let do = (p — 1)/too- 
Then [d + l, oo) z C W p (m ) C W p (m). 

Proof. Since mo\(p — 1), the group H of TOoth roots of unity in F p is exactly 
Fp° and has exactly to elements. We claim that \n * H\ > nm for n < d , and 
\n * H\ — p for n > d + 1. It suffices to prove this for n — 1, 2, • • • , d + 1 (for, 
once we show that | (do + 1) * H\ — p, then (do + 1) * H — F p , and this implies that 
(do + i) * H = F p for any % > 1). We proceed by induction on n, the case n = 1 
being clear. Assume that \n * H\ > nmo where n < d . By the Cauchy-Davenport 
Theorem (see [Ma: Cor. 1.2.3]), \(n + 1) * H\ is either p (and hence > (n + 1)to ), 
or else 

\(n + 1) * if | > \n * if | + | if | - 1 > {n + l)m - 1. 
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In the latter case, \(n + l)* H\{0}\ > (n+l)m -2. Since H acts on (n+l)*H\{0} 
by multiplication, |(n + 1) * H \ {0}| is a multiple of itlq. Since itlq > 3, we must 
therefore have \{n + 1) * H \ {0}| > (n + l)m , which gives what we want. This 
proves our claim for n < d . In particular, \do * H\ > domo = p — 1. By the 
Cauchy-Davenport Theorem again, (d + 1) * H must be F p , for otherwise we would 
have 

| (d + 1) * H\ > \d * H\ + \H\ — 1 > c?o m o + m o — 1 = V + ( m o — 1) > P? 

a contradiction. This completes our inductive proof. Thus, for any n > do + 1, we 
have G n * H. This means that n G W^(m ), and so [d + 1, oo) z C Wp(m ) C 
Wp(m). ' □ 

Example 3.2. In many cases Theorem 3.1 gives the best result. For instance, if 
p = 3 (mod 4) and m = (p — l)/2 > 3, then d — 2 and we have <i ^ W p (m) 
since m is odd. Even in the case p = 1 (mod 4), the Theorem may still give the best 
result. For instance, if p = 13 and m — 4, then d = 3 and G = {±1, ±8}. By a 
simple calculation, 3 * G = F 13 , so again do = 3 ^ W p (m). On the other hand, if m 
is divisible by two distinct primes p±, P2, then the fact that pi, P2 G Wp(ra) implies 
that [no, oo)^ C VFp(m) for n := (pi — l)(p2 _ 1) (see [LeV: p. 22, Ex.4]). In case 
the number d in (3.1) is "large", [n , oo) z C VF p (m) will of course give a better 
result. 

We can now derive the first case of Theorem 1.3. 

Corollary 3.3. Let K = ¥ p k ba a finite field containing all m th roots of unity, and 
let d = (p k —l)/m. If m := gcd(p— 1, m) > 3, then [d+1, oo)z Q W p (m ) C W p (m). 

Proof. Say p — 1 = m do and m = m mi. Then 
a = = do 



m mi 



Since gcd(d , mi) = 1, the fraction on the RHS above is an integer. Therefore, we 
have d | d, and the desired conclusion follows from Theorem 3.1. □ 



4. The Case gcdQo - 1, m) = 2 

We shall assume throughout this section that gcd(j> — 1, to) =2 (and as before 
to > 3). In particular, p is odd and to is even. In this case, W p (m) contains 2N 
and is stable under addition by 2. Thus, once we have an odd integer n G W p (m), 
we will have automatically [n — 1, oo)^ C VF p (m). This observation will be used 
without further mention in the following. 

Let K = W p k be any finite field containing the group G of all mth roots of 
unity. The following result gives a somewhat sharper form of Theorem 1.3 in the 
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case gcd(p — 1, m) = 2 (in that the index [K : G] itself is shown to be a weight, 
with a minor exception). 

Theorem 4.1. Assume that gcd(p — 1, m) — 2, and let d = [K : G] = (p k — l)/m. 
Then [d, oo)% C Wp(ra) unless p = 3 and m = 3 fc — 1 ? in which case [d+1, oo)i C 
W 3 (m). 

Proof. Let us first check the Theorem when m — 4. In this case, the assumption 
gcd(p — 1, m) =2 implies that G is not contained in F p , so k > 2. If p > 3, then 
d > (p 2 — l)/4 > p, and we have [d, oo)^ C [p, oo)^ C VF p (m). If p = 3, then 
d > (9 — l)/4 = 2, and we have again [d, oo) z C [2, oo)^ C ^ 3 (m) (since W^m) 
contains both 2 and 3). In £/ie following, we may therefore assume that m > 6. 

Write m = 2m', so that 

, 42 , rf _ (p - l)(p fe ~ 1 H hp+l)_p-l p fc_1 H hp+1 

^ ' ' ~ 2m~' ~~ ~~ 2 ^ ' 

Since gcd(m', (p — 1)/2) = 1, we have TO'|(p fc-1 H hp+1). If m' < p fc_1 H hp+1, 

the second factor on the RHS in (4.2) is > 2, so d > p — 1. Since p G Wp(ra), we 
have [d, oo)^ C [p — 1, oo)z Q W p (m), as desired. Therefore, in the following we 
may assume that 

(4.3) m' =p fc_1 H hp + 1, and d=(p-l)/2. 

In this case = F p _i-G, so any coset of G in K has a "scalar" representative. We 
fix a generator ( for the group G, and try to put a lower bound on the cardinality 
of the set A := F p n 2 * G. 

Recalling that m > 6, write 

(£ - \)G = ai G , (C 2 - 1)G = a 2 G , and (C 4 - 1)G = a 3 G where a { G F p . 

Clearly, ±aj G A, since —1 G G. Fzrst let us assume that these three G-cosets in 
K are different. Since Oj 7^ — a,, {0, ia^} are seven different elements of A. (In 
particular, p > 7 here.) Applying repeatedly the Cauchy-Davenport Theorem in F p , 
we see that \n * A\ > min{p, 6n + 1}. It follows that 

n > (p- l)/6 =^n*A = F p ^-lGn*A^2n + lG W^(m). 

This yields 2|~^p] + 1 G W p (m) (where [•] denotes the ceiling function). Writing 
p in the form 6t ± 1, we see easily that 2[ 2 ^-] = f 2 ^-]. Thus, in this case, we get 
the stronger conclusion that [ (p — l)/3, oo)% C W p (m). 

From now on, we may assume that the three cosets {g^G} above are not all dif- 
ferent. If a±G = a 2 G, then ( 2 — 1 = (( — for some i, and so ( l = ( + 1. Since 
m is even, this shows that 3 G W p (m), and so [2, oo)z C W p (m). We have cer- 
tainly no problem in this case (except when d = 1, which occurs only when p = 3). If 
a 2 G = a 3 G, we can finish similarly. Now assume a x G = a 3 G. Here, C 4 — 1 — (C — 1)C^ 
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for some j, so C J = C 3 + C 2 + C + 1- As before, this gives 5 G W p (m). If p > 7, then 
d — (p — l)/2 > 5, and we have what we want. Thus we are only left with the cases 
p = 3, 5, 7. 

If p = 3, we have d — (p — l)/2 = 1 (and m = 3 fc — 1 by (4.3)). In this case the 
desired conclusion is [2, oo) z C W 3 (m), which is true since 2, 3 G W 3 (m). 

If p = 5, then d — 2 G W^m). In this case we need to show that 3 G W 5 (m). 
If aiG = g^G, we are done as before. Otherwise, one of these cosets must be the 
identity coset G (since [K : G] — d — 2), and this implies again that 3 G W^(m). 

Finally, we treat the case p = 7. Here we must show that [K : G] = d = 3 is in 
the weight set W 7 (m). We first note that: 

(4.4) // (C - l)G = (C - l)G = (C +1 - 1)G for some i > I, then 3 G W 7 (m). 

Indeed, if we write C - 1 = (( - l)( r and - 1 = (C - 1)C then ( r = 
-1 + • • • + C + 1 and C = + • • • + C + 1 imply that ( s = C + ( r , so 3 G W 7 (m). 
Now let C, C be the two nonidentity cosets of G in i^. By reasonings we have 
used before, we may assume that (( 2 — 1)G = C and (( — 1)G = (C 4 — l)G = C. 
Noting that 3 is prime to m (since gcd(p — 1, m) = gcd(6, m) = 2), we may also 
assume, in view of (4.4), that (£ 3 — 1)G = C. Replacing ( by ( 3 , we may further 
assume that (( 9 — 1)G = C. Next, note that since m > 6, it cannot divide 10, 
so ( 10 7^ 1. Thus, in view of (4.4), we may assume that (( 5 — l)G = C , and hence 
that (C 10 - 1)G = C Now we have C = {( - 1)G = (C 9 - 1)G = (C 10 - 1)G, so 
3 G W 7 (m) once more by (4.4). □ 

5. The Case gcdQo - 1, m) — 1 

Throughout this section, we shall assume that gcd(p — 1, m) — 1 (and as before, 
m > 3). The analysis of the weight set W^,(m) in this case turns out to require the 
hardest work. 

The assumption that gcd(p — 1, m) — 1 means that the only mth root of unity 
in F p is I. Therefore, upon factoring the polynomial X m — 1 modulo p, we have 

(5.1) X m -l = (X-l) gi (X)g 2 (X)--- , 

where the & 's are irreducible monic polynomials in F p [X], each of degree > 2. Let 
£ := min{deg(gj)}. This integer £ will play an important role in finding the estimates 
on W p (m) in this section, so let us first note a few other characterizations of it. 

Recall that the cyclotomic polynomial $ n (X) G 7h\X\ factors modulo p into a 
product of irreducible factors each of degree given by the order of the element p in 
the unit group U(Z/nZ) (see, e.g. [Gu]). Since X m - 1 = U n \ m ®n{X), it follows 
that £ is the minimum of the orders of p in (7(Z/nZ) for n ranging over the divisors 
of m greater than 1. From this, we see that £ is also the minimum of the orders of 
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p in C/(Z/gZ) for q ranging over the prime divisors of m. It is now an easy exercise 
to check the following: 

(5.2) t = min{e > 1 : gcd(p 6 - 1, m) > 1}. 

This simply means that ¥ p e is the field with the smallest extension degree over ¥ p 
which contains an mth root of unity other than 1. This can also be verified directly 
from the definition of t. 

For the rest of this section, let L := ¥ p t, m' := gcd(j/ — 1, m), and let H be 
group of m'th roots of unity in L. By (5.2), \H\ — w! > 2. It will be important 
to work with the set T := tr (if) where "tr" denotes the field trace from L to F p . 
The next theorem gives a description of W p (m) in terms of £ and the cardinality 
t := \T\ (under the standing assumption that gcd(p — 1, m) — 1). 

Theorem 5.3. Let t and t be as defined above, and let n := ■ Then 

[in, oo) z QW p {m')QW p {m). 

Proof. Applying the Cauchy-Davenport Theorem to the subset T in ¥ p , we have 
\2*T\ > min {p, It — 1}, and inductively \i*T\ > min {p, it — (i — 1)}. By the defi- 
nition of n, we have n(t — 1) > p — 1, so nt — (n — 1) > p. Therefore, \n * T\ = p. In 
particular, for every j > 0, there exists an equation ti + - ■ • + t n — — j 1 e ¥ p , where all 
ti G T. Now each U G ti(H) is a sum of £ elements of H , so ti H — • + t n + j • 1 = 
is a vanishing sum of m'th roots of unity of weight in + j. This shows that 
[in, oo) z C W p (m') C VF p (to), as desired. □ 

Note that the above theorem is meaningful only if we know that the trace set 
T C F p has at least two elements. Fortuitously, this is always the case, according to 
the following result. 

Trace Lemma 5.4. In the notations of (5.3), t — \T\ > 2. 

The proof of this lemma will be postponed to the last section (§6). We shall first 
assume this lemma and try to get to the main conclusions of this section. Note that 
the larger the trace set T is, the better bound on W p (m) is given by (5.3). Since 
t > 2 by (5.4), we have in any case: 

Corollary 5.5. [£(p-l), oo) z C W p (m') C W p (m). 

Now consider any field K = ¥ p k containing the group G of all mth roots of unity. 
Clearly, L C K, and H = L n G. Let d= (p k - l)/m and d' = (p e - l)/m'. We 
now proceed to the proof of the following, which is a stronger version of Theorem 1.3 
in the case treated in this section. 

Theorem 5.6. Assume that gcd(p — 1, m) = 1. Then \d', oo)i C W p (m') and 
[d, oo)z Q: W p (m) except in the following two special cases: (A) d' = p — 1; (B) 
p — 2, d' = 3, anrf m' = 5. In these special cases, we have [d' + 1, oo)^ C W p (m') 
and [d + 1, oo) z C W p {m). 
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Proof. Since d' — [L : H] — [LG : G] divides d = [K : G] and m! \ m, it suffices to 
prove the theorem for W p (m'). Let 

(5.7) s = (p e ' 1 + ---+p+l)/m', 

so that d' = s(p—l). First let us treat the special case (A), where we have s — 1. Here 
we are supposed to prove that [p, oo)^ C W p (m'). Since now [L : H] — p — 1 (and 
HC\W P = {1}), we have L = H -¥ p . Therefore, fixing a primitive m'-th root of unity 
a G H, we have a — 1 = b~ x a l for some integer % and some 6 G F p . For convenience, 
let us think of b as an integer in [1, p — 1]. Multiplying 6 • 1 + (p — 6) • 1 = G F p 
by a and using the relation ba = b + a\ we get 

= ba + (p - 6) a = b ■ 1 + a 1 + (p - 6) a, 

which is a vanishing sum of weight p + 1. Multiplying this by a again and repeating 
the argument, we get vanishing sums (of m'-th roots of unity) of weight p + % for 
any % > 0. Coupled with p G W p (m'), this gives [p, oo) z C W p (m'), as desired. For 
the rest of the proof, we may assume that s > 1 . We claim the following: 

Lemma 5.8. s > 1 implies that s > £, except perhaps when p = 2 and £ = 4, 6, 8, 9. 

Thus, leaving aside the four special cases, we have d' = s(p — 1) > £ (p — 1), so 
the desired conclusion for W p (m') in (5.6) follows from (5.5). The four special cases 
will have to be treated later. 

Proof of (5.8). We go into the following two cases. 

Case 1. £ is prime. We claim that £ < q for any prime q \ s (and therefore £ < s). 
In fact, from (5.7), we get p 1 ' 1 + • • • + p + 1 = (mod q), so p e = 1 (mod q). If 
p ^ 1 (mod q), then p has order £ in [/(Z/gZ) (since £ is prime), and so £\ (q— 1). 
In this case £ < q — 1 < q. If p = 1 (mod q), then from p l ~ l + -- -+ p+ l = £ = 
(mod q), we have in fact £ = q. 

Case 2. £ is composite. Let q be the smallest prime divisor of £ and write £ = qt. 
Then 1 < t < £ and (5.2) implies that gcd(p* — 1, m) — 1. Since (p* — 1) | (p e — 1), 
we see that p* — 1 divides (p e — l)/m' = s(p — 1). Thus, s > (p* — l)/(p — 1). We 
shall now exploit the following elementary fact which is easy to prove using calculus: 

Lemma 5.9. p x > (p— l)x 2 + 1 for every x G [2, oo)^ £/ie exception of p = 2 
and x = 2, 3, 4. 

Applying this lemma to x = t, we get the desired conclusion 

— 1 

s > >t 2 >qt = £, 

p — 1 

except when p = 2 and t — 2, 3, 4. If t = 2, we have g = 2 so £ = 4. If t — 3, we 
have g = 2, 3, so £ = 6 or 9. Finally, if t — 4, we have g = 2 so £ = 8. This proves 

(5.8) , but we still have to complete the proof of (5.6) in the four special cases noted. 
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In these cases, d' = {2 l — l)/m', so both m', d' are odd (and > 1). We may 
assume that d' < ml. (For, if d' > m', we have [m', oo)z Q W2(m') since m' is 
odd, and hence [d 1 ', 00)1 C W2(m').) We simply have to check the four outstanding 
cases individually. 

(1) £ = 4. Here 2 l — 1 = 15, so d' — 3, to' = 5, and we are in the case (B) of 
(5.6). The desired conclusion in this case is [4, oo)z Q W^to'), which is true since 
5 = rri e W 2 (m'). In fact, by (2.6), we have W 2 (5) = 2 N + 5 N = {0, 2} U [4, oo) z . 
In particular, 3 ^ 1^(5), so this case is truly exceptional. 

(2) £ = 6. Here 2 l - 1 = 63, so we have either d' = 7, to' = 9 or d' = 3, m' = 21. 
In both cases, [2, oo) z C Vl^m') (since 2,3 G W^(m')), so there is no problem. 
(Actually, in the case d' — 7, we are in the good case s = d' > £ already.) 

(3) £ = 8. Here 2^ — 1 = 255, so we have either d' — 5, m' = 51 or d' — 15, m' — 
17. The latter case presents no problems, since we are once more in the good case 
s = d' > £. In the former case, 3 | ml implies that ^(m') = [2, oo)z, so again there 
is no problem. 

(4) £ = 9. Here 2 £ - 1 = 511, so d' = 7, m! = 73. We have shown in (2.5) that 
H^(73) = {0} U [2, oo)z, so there is no problem. 

This finally completes the proof of Theorem 5.6. 

6. Traces of m th Roots of Unity 

In §5, we stated without proof the Trace Lemma 5.4, which was crucial for the 
proofs of (5.5) and (5.6). In this section, we return to the trace set T = tr(if), and 
offer a general analysis of T which we believe to be of independent interest. The 
proof of the Trace Lemma is an easy by-product of this general analysis. 

The notations (and hypotheses) introduced at the beginning of §5 will remain in 
force. In particular, H is the group of m'th roots of unity in L = ¥ p e, and "tr" is 
the field trace from L to ¥ p . To enumerate the elements in T, let 

(6.1) X m ' - 1 = (X- l)h x (X)---hr(X) 

be the factorization of X m ' — 1 into (monic) irreducibles over ¥ p . Then deg hi > £ 
by the definition of £ (and the fact that (X m ' — 1) | (X m — 1)). On the other hand, 
since L contains all m'th roots of unity, each hi(X) splits completely in L, so 
deg hi < [L : F p ] = £. Therefore, deg hi = £ for all i. Let 

(6.2) hi(X) =X e - a % X 1 - 1 + • • • , 

and let {a^} be all the roots of hi(X) in L. For each h i: we can identify the field 
¥ p [X]/(hi(X)) with L by the correspondence X <-> aij (for any j). Therefore, 
tr(a!jj) = J2 k Oiik = cii for all i, j. We have thus 

(6.3) T = {tr(l), ai, • • • , a r } = {£, a u • ■ ■ , a r }, 
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with possible duplications. 

It is now easy to prove the Trace Lemma 5.4, which asserted that \T\ > 2. Assume, 
for the moment, that T is a singleton. Then, by (6.3), a, = £ for all %. Summing 
all roots of the polynomial in (6.1) (and recalling that w! > 2), we get 

= 1+ aH ha r = l + r£ = m' e¥ p , 

contradicting the fact that m' is prime to p. 

The equation (6.3) gives an upper bound \T\ < 1 + r, and this becomes an equality 
iff the elements listed in (6.3) are distinct. This is the case, for instance, if £ = 2. To 
see this, note that the constant term of each hi(X) in (6.1) is 1, since it is an mth 
root of unity in F p , and we are assuming that gcd(p— 1, m) = 1. Therefore, if £ = 2, 
we have hi(X) = X 2 — aiX + 1. Since the hi 's are distinct, so are the Oi 's, and of 
course at ^ 2 (since otherwise hi(X) = (X — l) 2 ). Therefore, \T\ = 1+r, and (5.3) 
gives the pretty good estimate [2n, oo)z Q W p (m') C W p (m) with n = I" 2 — For a 
simple example of this, let p = 5, and m = 3. Here m! = 3, £ — 2, r = 1, L = F 2 5, 
and |T| = 2. By (2.6), W 5 (3) is the set 

3N + 5N= {0, 3, 5, 6}U [8, oo) z . 

Since 2n = 8, the conclusion in (5.3) is sharp here. 

In general, t := \T\ may be less than 1 + r, since there may be duplications among 
the elements of T listed in (6.3). For an example where interesting duplications 
occur, take p = 7 and m = 19. Here m' = 19 and £ = 3, r = 6. Mathematica 
gives a factorization 

X 19 -l = (X- l)(X 3 + 2X + 6)(X 3 + 4X 2 + X + 6)(X 3 + 4X 2 + 4X + 6) 

• (X 3 + 5X 2 + 6)(X 3 + 3X 2 + 3X + 6)(X 3 + 6X 2 + 3X + 6) G ¥ 7 [X\. 

Since —4 = 3 = tr(l) in F 7 , T has only five (two less than 1 + r = 7) distinct 
elements {0, 1, 2, 3, 4}. In this case, the number n in (5.3) is \ = 2, and (5.3) 
shows that [6, oo)z Q 1^7(19). Note that, in spite of the trace duplications, this is 
still much sharper than what is given in (5.6). 

In general, we cannot hope to improve upon the lower bound |T| > 2. For one 
thing, F p may have only two elements to begin with. Also, we may have r = 1, in 
which case (5.4) and (6.3) show that |T| = 2. Even if p > 3 and r > 2, there are 
many cases in which T is just a doubleton. Let us illustrate the situation r = 2 by 
taking m to be an odd prime q (so that m! = q too), and assuming that p is also 
odd and has order £ = (q — l)/2 in the group U(Z/qZ). In this case, r = 2, and 
(6.1) becomes 



(6.4) 



x q - 1 = (x - i)/i!(x)/i 2 po 
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where hi, hi are monic irreducible (over F p ) of degree £. Following a standard 
notation in number theory, let us define q* to be q if q = 1 (mod 4), and q* to be 
— q if q = 3 (mod 4). Then the size of the trace set T = tr(H) is determined as 
follows. 

Proposition 6.5. Under the above assumptions \T\ = 2 iff p\ (q* — l) (and \T\ = 3 
otherwise). 

Proof. Let E = Q(£) where C = e 2 ™ /<? , and fix a generator t of U(Z/qZ). Then 
(J : C l—> C* is a generator for Gal(F/Q), and a 2 : £ i— > is a generator for 
Gsl(E/F), where F is the fixed field . Note that X 9 " 1 + • • • + X + 1 factors 
into f(X)g(X) over F[X], where 

/(X) =X l -aX l ~ l + ■■■ and g{X) = X e - bX*' 1 + • • • 

are, respectively, the minimal polynomials of ( and over F. We have 

a - 6 = tr £/F (C) - tr E/F (C<) = ^ " E e ^ 

This is precisely the quadratic Gauss sum (with respect to the Legendre character 
on W q ), so by [IR: (8.2.2)], a — b = y/q* . (Gauss showed that the ^fq* here is the 
one taken in the upper half plane if q = 3 (mod 4), but this will not be needed in the 
following.) Since we also have a + b = —1, it follows that 

a = (v^-l)/2, 6 = -(v^+l)/2. 

Incidentally, this proves the well-known fact that F = Q(^/q*). 

Let R be the ring of algebraic integers in F. Since p is unramified in E, it is 
also unramified in F, so pR = pp' , where p, p' are distinct prime ideals of R, 
both of residue degree 1. Identifying R/p with F p , we may take the polynomials 
hi, hi in (6.4) to be / and g, where "bar" means reduction modulo p. In particular, 
T = {£, a, b} by (6.3). Here the two elements a, b are always different (for otherwise 
p would contain (a — b) 2 = q* as well as p). Therefore, T will have only two elements 
iff p also contains 

4(e -a)(e-b) = (q-V¥)(q + V¥) = q 2 -q* = q*(q* - 1)- 

Since q* ^ p, this happens iff q* — 1 G p, that is, iff p \ (q* — 1), as claimed. □ 

Corollary 6.6. Let q = U + 1 and p = 2£' + 1 be distinct primes such that the 
order of p is £ modulo q. If p \ (q* — 1), then [2££', oo)i C W p (q). Otherwise, 
[££', oo) z C W p (q). 



Proof. This follows from (5.3) and (6.5), since the number n in (5.3) is 2£' in the 
first case, and £ ' in the second case. □ 
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Example 6.7. Let q — 11 (with q* = —11). Then the primes 3 and 5 both have 
order £ = (q — l)/2 = 5 modulo q, and according to Mathematical 

X u - 1 = (X - 1)(X 5 + X 4 + 2X 3 + X 2 + 2)(X 5 + 2X 3 + X 2 + 2X + 2) G ¥ 3 [X], 
X u -1 = (X-l)(X 5 +2X 4 +4X 3 +X 2 +X+4)(X 5 +4X 4 +4X 3 +X 2 +3X+4) G F 5 [X]. 

Thus, for p = 3, T = {£, -1, 0} = {2, 0} in F 3 . This is consistent with (6.5) since 
p = 3 divides q*-l = -12. Here, £ = 5, V = 1, so (6.6) gives [10, oo) z C W 3 (ll). 
(In fact, from 0, 2 G T, we see easily that 5,6 6 W 3 (ll), and so 8, 9 G W 3 (ll) 
also.) On the other hand, if we choose p — 5, then T = {£, —2, —4} = {0, 3, 1} 
in F 5 , consistently with (6.5) since p = 5 does not divide q* — 1 = —12. Here, 
£ = 5, £' = 2, so (6.6) gives again [ 10, oo) z C W 5 (ll), and 0, 1, 3 G T show further 
that 5, 7, 9 G W 5 (ll). 

The arguments in the proof of (6.5) can be generalized. However, if the order of 
p modulo q is smaller than (q — l)/2 (in other words r > 2), the computations of 
the trace elements in T will involve Gaussian sums with (higher) character values as 
coefficients. We shall not go into this analysis here. We should point out, however, 
that if q is fixed, then the prime ideal method (in characteristic 0) used in the proof 
of (6.5) will suffice to show that the upper bound |T| < 1 + r becomes an equality 
for sufficiently large p. Therefore, by (5.3), [in, oo)% C W p (q) with n = \^- \ , for 
sufficiently large p. 
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